02/06/2024
Game theory studies strategic interactions
and multiperson decision theory
Defination - Normal form game
A normal form game consists of three things:
A set of 
A set of actions 
And the action profiles: 
Payoffs. 
Example
A game of
, , and 
7,3 3,5 5,5 7,2 The payoff here is (player 1, player 2)
Defination (strictly) dominate
Action 
对所有的player 2 选择的strategy,player 1选择 
Definate weakly dominate
Action 
In summary, strictly domination implies weak domination, and any two actions cannot weakly dominate each other.
And we have some descriptions:
Action 
Action 
Action 
Action 
Then,
A rational player will never chooses a dominated action, and will always chooses a dominant action.
Example
| Player | 2 | ||
|---|---|---|---|
| Player | |||
| 1 | |||
if we do not know the payoffs for player 2, then we have no dominant and dominated actions so far.
Defination - Beliefs
Let 
Tip
Here 
Example
The expected utility 
Given 
Since the set of actions are finite, there is always at least one best action.
Defination - Best response
Action 
Note
The best response correspondence is
Best response could be multiple.
Tip
Example - Sun Rain Game
| Rain God | ||
|---|---|---|
| Player | Sun | Rain | 
| No Umbrella | ||
| Umbrella | 
Let 
Find the best response correspondence given 
Therefore, if 
Otherwise, the choice of 
and if 
Defination - Never a best response
If 
Proposition
A strictly dominated action is a NWBR.
Proof
If action
is strictly dominated by an action , , so , so action is a "Never a best response (NWBR)". 
Tip
NWBR does not necessarily means being strictly dominated by a pure strategy. It could be dominated by a mixed strategy.
Example
| U | D | |
|---|---|---|
| L | 3 | 0 | 
| M | 1 | 1 | 
| R | 0 | 3 | 
There is no strictly dominated action,
let's say 
Let's compare the expected utility given belief of player 2, for action 
When 
When 
SO in this case, 
Why? because M is dominated by a mixed strategy.
A mixed action for player 1 is a probability distribution 
The support of a mixed action is the set of pure actions given positive probability.
The expected utility (payoff) of 
 Suppose 
This means that a mixed action cannot yield a payoff higher than the best pure action in its support, since the paoff of the mixture is a convex combination of payoff of the pure action in the support.
Tip
But the mixed action is also something we needed, since it is conditional on the belief, it could be different conditional on other beliefs.
Warning
mixed strategy as best response
Suppose we extend the defination of the best response to mixed actions.
Then we have a following proposition, if we have 
then every 
Proof as homework
Suppose we have 
hence, 
Therefore, we can show 
That brings a contradiction with 
So, every 
Note
Action 
Proposition
Proof as homework.
We want to show if 
Suppose 
Suppose 
Equilibrium is in undominated strategies. We have already known that a rational player will not play a dominated action, sometimes this fact by itself is enough to give a prediction for the solution of the game
Example
Each player has only one action that is not strictly dominated , so
is the solution. 
Warning
Solution is always a pair instead of a strategy
Example - Sun rain game
There is no strictly dominated action for player 1,
is strictly dominated by , so is dominant. Player 2 will always plays . Then player 1 knows that Player 2 is rational and it will never play a strictly dominated action. Hence we can reduce the game by eliminating the strategy
for player 2. So
is the outcome for this game 
Notice that we obtain this solution by iteratively deleting the strictly dominated actions.
Let's see what happens if we delete weakly dominated actions
strictly dominates , so we can eliminate the strategy for player 1. P1 will never players . 

and can be the solution of the game But if we start with player 2 and eliminated the weak dominated strategy
, then it will only result in one solution 
Warning
Eliminating weakly dominated strategy could be risky of losing solutions, and the final solution depends on the orders of deleting. So we never eliminate weakly dominated strategies.
Consider the following case
P1\P2 D E F A B C Starting P1
is weakly dominated by , is weakly dominated by , and is also weakly dominated Then we have
and  as the solution. 
Nash equlibrium
A pure action profile 
  That is 
A mixed action profile 
That is 
Example
Prinsion Dilemma:
Consider the best response,
for player 1,
, and for player 2, . So
is a Nash equilibrium. We do not have a Minxed strategy nash equilibrium in this example, because strategy
is dominated by both agents, so is a never a best response. So any mixed strategy that has positive probability on  will be dominated as well and becomes a NWBR. 
Example - Sun rain game
Consider this example again.
Player A
, . 
Player B
, . 
So the pure strategy nash equilibrium is
. Now, what about if we have this payoff?
In this new payoff game, we have two pure strategy nash equilibrium
and . And we also have a mixed strategy nash equilibrium.
Proposition
Tip
Any action played positive probability in any NE must survive iterated deletion of strictly dominated actions.
Proof as homework
Suppose there is an action that is played positive probability in a NE
Looking for mixed action equilibrium
Example - Battle of the sexes
Two pure strategy nash equilibrium
, and . Suppose the probability of your partner choosing
is , and your probability of playing is  . If you are mixing then you must be indifferent between
and  which implies conditions on the action that your partner is playing So, for you, the expected util of playing
is , and . Similarly, your partner must be indifferent between
and to be willing to mix. 
. Hence,
is the mixed strategy nash equilibrium in this example. 
,Example - matching pennies
This is a zero sum game:
for all and . In this case, we do not have any best response to best responses. So we do not have any pure strategy nash equilibrium. But this does not imply that we cannot find any mixed strategy equlibrium, we need to check it.
Suppose
is a mixed action NE. Then,
. Similarly, we can calculate
. Therefore, we can find a mixed strategy nash equilibrium
. 
Tip
Even if we cannot find a pure strategy nash equilibrium, it is possible to find a mixed strategy.
Typically we will find an odd number of nash equilibriums.
Example - Cournot duopoly
There are two firms each of whom chooses a quantity.
. Let , The payoffs are
is the price of , which is inverse demand, and is the cost of production. Let
, , for each and , The best response corresponces are
Suppose there exists an interior solution,
FOC:
Therefore,
and
otherwise. Then substitute it back to the b est response
This is a simultaneously move game
Question
Suppose that a game has two NE. Suppose one of the NE involves weakly dominated actions and the other doesn't, which one is less likely to be played?
Or suppose one of them is Pareto dominated by another NE. which one is less likely to be played?
Let's have an example for illustration,
| L | R | |
|---|---|---|
| T | ||
| B | 
The PSNEs are 
And we know that 
And if we see the both players' strategies, we know that 
Then if we consider the 1st question, say maybe we choose 
Tip
Theorem
Any finite game has at least one NE.
Proof as homework
Here is a short self-contained proof.
We will define a function 
Given a mixed strategy profile 
The expected utility of player 
For 
Clearly, 
with equality achived only if 
JR Theorem 7.2 Proof
Proof: Let 
We shall show that a Nash equilibrium of 
Step 1: Define 
Let 
Step 2: Because the numerator defining 
Step 3: Because 
or
Multiplying both sides of this equation by 
Now, a close look at the left-hand side reveals that it is zero, because
where the first equality follows because the 
But the sum on the right-hand side can be zero only if 
Theorem 7.2 is quite remarkable. It says that no matter how many players are involved, as long as each possesses finitely many pure strategies there will be at least one Nash equilibrium. From a practical point of view, this means that the search for a Nash equilibrium will not be futile. More importantly, however, the theorem establishes that the notion of a Nash equilibrium is coherent in a deep way. If Nash equilibria rarely existed, this would indicate a fundamental inconsistency within the definition. That Nash equilibria always exist in finite games is one measure of the soundness of the idea.
Recall the battle of sexes game,

We know that the two PSNEs are 
Suppose the man need decide first, then 

Integrents of extensive form game
Players 
Nodes: 
Decision nodes 
Terminal nodes 
Directions
predecessors and successors
Each node (except for initial node 
Terminal nodes have no successor.
The game tree is the initial node 
A subtree is any node and its successors
Whose decision function
Actions: 
Each different act8ion leads to a different successor node.
Information sets
A player cannot distinguish between nodes in an information set but can distinguish between information sets.
The same players at each decision node in an information set, if 
The same actions are available at each decision node in an information set, i.e. if 
Nature's moves: nature moves randomly according to commonly known probability
Payoffs 
Properties of extensive form game
A game has perfected recall if players don't forget anything during the game
A game has perfect information if each information set contains only one node: that is if playersknow the history of moves so far.
For example in this game we have four information sets, three of them contains more than one nodes, so this game is not a perfect information game. It is simultaneously move game, which means any player does not know the other player's action.

A game has complete information if the structure of the game is common knowledge (each player could draw the game tree).
we will usually transform incomplete information games into imperfect information games by letting Nature choose the structure randomly and unobservably.
Important
Complete info and Perfect info
Strategies and payoffs
In extensive form game, we will distinguish strategies and actions. we don't do that for normal games.
Let 
The pure strategy for 
Let 
A strategy is a complete contingent plan of actions.
A pure strategy specifies a player's choice of action of each of her information sets, even for sets that are not reached.
Tip
Example: In this game we have 3 information sets for 2 players.

Then we define Mixed strategy ,
A mixed strategy 
For example, 
We know that we can transform extensive form game to normal form game by writting strategies and their corresponding payoffs. Then using this table we can calculate the mixed strategy nash equilibrium.
A Nash equilibrium of an extensive form game is a NE of the corresponding normal form game.
Refinement of NE
Backward Induction
is the process of analyzing a game from end to begin.
Example
A parent is driving Disneyland with a child in the back seat. The child is making a lot of noise. Parent says "be quiet, or I will turn this car around." This situation is represented in the following extensive form game, where the child's payoffs are given first.
Write down the norm form game:
TT TD DT DD Q -10,-5 -10,-5 5,10 5,10 N -5,-10 10,5 -5,-10 10,5 There are three pure strategy nash equilibrium here in the norm form game,
. By back ward induction, the remaining strategy profile is
. The strategy profile
is a NE. But is th e parents actually willing to turn back at node (left lower node)? No So is that a credible threat? No
Conditional on reaching node
, Parent's BR is Disneyland. The same is true at node . Anticipating these decisions, child know that if he chooses quiet he will end up with payoff of 5 and if he choose noise he will end up with payoff of 10. The strategy profile
is the backward induction solution to this game and it's also a NE. 
Important
Proposition
Every backward induction solution of an extensive form game is NE.
Every finite perfect information game has at least one backward induction solution in pure strategies. Thus every such game has a PSNE.
But backward induction solutions may involve weakly dominated strategies.
Example
Consider this game:
, . Consider the following normal form game:
C D A 1,1 1,1 B 1,1 0,0 
and are both solutions to this game, but is weakly dominated. 
Example
This is an imperction game, so backward induction cannot be applied. BUT we can still eliminate strategy
because it is strictly dominated. If we change the payoff to this scheme:
We need subgame perfect
Defination
A subgame is a sub tree such that
Starts at a decision node and
Contains no broken information set
That is if an information set contains a node in the subgame, then every node in that information set is contained in that subgame.
Example:
This game has 2 subgames. One starts from
and the other starts from . 
Defination - SPNE
A subgame perfect equilibrium is a profile of strategies where restrictions to any subgame forms a NE of that subgame.
Note
Every subgame perfect equilibrium is a NE (To the whole game), if a game has only one subgame, which is itself, then every NE is a subgame perfect Nash Equilibrium (SPNE).
Tip
In games of perfect information (we do not have any info set that contains multiple nodes, i.e. no simultaneous move game), the set of subgame perfect equilibria and the set of Backward induction are the same
Our backward induction is actually deriving a SPNE.
The advantage of subgame perfection is that it is defined even for games with imperfect information.
TO find a SPNE in an imperfect information game, the procedure is similar to backward induction. we just replace subgames at the end of the game with their equilibrium payoffs, and repeat until you reach the initial node.
Example
There are two subgames in this game.
For the second subgame, the normal form is,
P3 L R Player 2 T 0,1 1,0 B 1,1 0,0 
is the unique aNE of subgame strategies from . Then the payoff at node
degenerates to , then apply backward induction, palyer 1 will choose , and the SPNE is . Next, consider the following exercises: find all the Nash equilibriums in this game (Hint: rewrite this game into a normal form game, we typically need two matrixes)
When
is played by P1, 
L R T 1,10,10 ✅ 1,10,10 ✅ B 1,10,10 1,10,10 ✅ When
is played by P1, 
L R T 0,0,1 0,1,0 😂 B 2,1,1 ✅ 0,0,0 
Example
In this game, by backward induction, the SPNE is
. but we do have other Nash equilibriums. 
Let's add a couple of actions to the battle of sexes.

We have two pure strategy nash equilibriums.
Now, let's augment some of the choices, adding 
| O | B | C2 | |
|---|---|---|---|
| O | 2,1 | 0,0 | 6,0 | 
| B | 0,0 | 1,2 | 0,0 | 
| C1 | 0,0 | 0,0 | 5,5 | 
Still, we have the same original two nash equilibrium. Adding 
But consider the game 
The payoffs from 
There exists a SPNE in which 
, and in the second stage
They represents your strategy in the first and second stage,
The following strategy form a SPNE of 
We consider this repeated game using SPNE, in the second stage, the SPNE is 
First, note that this strategy profile is a Nash equilibrium of the whole game. If man follow the equilibrium he gets payoff 
For the woman, she does not has any incentive to deviate. She plays a statistic best response in both periods.
The equilibrium is a SPNE because both 
Now, lets define this game formally.
Defination
Given a normal form game 
A history 

Like 
A pure strategy for player 1 is 
The outcome of a pure strategy profile 
Like the previous example,
, We define like
, . Then we can also write in this way:
. 
The value 
Definations
The payoff of player 
like,
. It will sometimes be convenient to rescale these payoffs so that they are directly comparable do the stage game payoff
From now, we do not asuume
. If
, then, , and . if
. say
 
Tip
Finitely repeated games
Consider prisoner's dilemma. we repeat it for 
| C | D | |
|---|---|---|
| C | -1,-1 | -4,0 | 
| D | 0,-4 | -3,-3 | 
There is a unique NE for stage game.
Here becasue each subgame only have one nash equilibrium, so when we do backward induction, we can degenerate the game tree always with a certain equilibrium. So there will be no room for punishment to deviate from the nash equilibrium at each stage.
Formally, we say in any subgame perfect equilibrium of 
Proposition
Suppose that stage game 
if the stage game has more than one NE, then the finitely repeated game has more than one subgame perfect equilibrium. For example, for each 
Given a normal form game, let 
The minmax payoff 
That is the other players are minimizing the value of 
The minmax strategy that they (the other players) use against 
Proposition
For any 
Note
Proof idea
Player 
Tip
Example

Since 
For 
Tip
Example

Think about the mixed strategy.
To find 
if 
which means that 
Infinitely Repeated Games
The principle of optimality (also known as the one shot deviation principle) is most useful when we deal with in finitely repeated games Principle of optimality
A strategy profile 
The principle of optimality makes checking for subgame perfection much easier by reducing the number of deviations that we need to check. The idea is that if there is any profitable deviation, then there must be profitable one-shot deviation.
Tip
Example

Strategy:
if no one has deviated, Man's equilibrium continuation payoff is 
Since any deviation results in the same future play, Man's best deviation is his best static deviation is 
Then the payoff is 
Then we look at the woman's equilibrium continution, woman's best deviation is either 
In any history after a deviation, man's equilibrium continuation payoff is 
Refinement of SPNE

In this game we have two SPNE, pure strategy, 
But for player 2, 
So, even if 
We ought to require that at nontrival info sets, players maximized expected utility according to some beliefs about which node they are at.
Tip
Example

There are two subgames.

We know that 
Let decision node 
Tip
Understanding in this way:

In our analysis, we allow nature to be the the first player in the games in which we have nature. Given the strategy profile 
Defination
[Blelief] An assesment is strategy/condtional belief pair 
Note
A system of beliefs 
A system of beliefs specify, for each information set, a probabilistic assessment by the player who moves there, conditional upon play having reached that information set.
Let 
[Sequentially rational] An assessment 
That is 
Note
A formal statement
A strategy profile 
for all 
An assessment 
it is sequential rational , and
belief 
Note
Proposition
if 
Proof by intuition
Suppose 
Therefore, denoting by 
Hence, given belief 
This inequality condition violates the defination of sequential rationality. So in that case, if 
Therefore,  if 
Tip
Example

In order to find PBE, we know it must be a subset of SPNE, so we can first find the set of SPNE.
In this example, we only find one SPNE, so it must be part of PBE.
And suppose player 2 know that player 1 will always choose 
When 
Another example,

we know that in this game 
We know that 
if 
Tip
Example

In this game, player 2 only knows the player 1 chooses 
We only have one subgame,

Each of the part, 

The NE of this game is 
They are SPE,
Then,,
Homework

Consider the following sealed -bid first price auction,
Sealed-bid, no knowledge about the other participants
first price: the bid with highest price wins.
single object
only two allowable bids 
two risk neutral bidders (maximizes expected payoff)
Bidder 
Each bidder views her valuation and then simultaneously and independently submits bid of either 
higher bidder wins and pays her bid
ties broken by fair coin.
[Solution]
There must be a threshold 
At 
Suppose 2 bids according to 
the payoff 
then we can calculate the expected payoff for player 1, 
And, 
Then player 1 is indifferent vetween bidding 
and player 2 is indifferent between bidding 
Defination
Let 
   
Standard Auction Formats
First price auction: Each bidder 
Second price auction: highest bidder wins and pays the second highest bid.
English auction: Bidder dynamically submit successively higher bids. Final bidder wins and pays her final bid.
Dutch auction: Auctioner starts at a high price, successively announces lower price until someone bids. The lowerst bidder wins and pays the current price;
Homework
Intuitively explain the difference and simularities between Dutch, English and first price.
[Solution] of the sealed bid second price auction
Consider the following cases,
| Bidder | Bidder | Bidder | |
|---|---|---|---|
Playing Bidder 
A seller owns one unit of an indivisible good, the good is worthless for the owner unless she can sell it. There are two buyers, 1 and 2. Assume that buyer's valuation of the good is private information. Available information for the players 
Consider the first price sealed bid auction. Buyers strategies depends on their valuation.
Let 
A BNE is a pair of strategies such that
 Then, 
Then,
Then the maximization problem becomes,
 FOC: 
Then, 
Then 
Then the optimal bidding strategy is 
The expected payment of winner: 
The expected gain for the seller: 
Homework:
Find the expected payment of winner, expected gain of winner, and expected seller gain under second price sealed bid auction.
The optimal strategy for each bidder is to bid
, if we assume each bidder's valuation Suppose there are only two bidders in this auction. Denote
as two players' valuation. 
expected payment of winner
Expected gain of winner
Expected seller gain
A worker's random type or ability 
Firm complete over wages to hire the worker. The firms observe education 
 The cost of attaining education level 
Assumption on 
Payoff to a worker of type 
The profit to the firm is
We look for symmetric PBE in pure strategies, That is we look for 3 functions,
To be PBE, the functions must satisfy the following conditions,
To simplify the notation, let 
There are two types of PBE,
Pooling 
Separating 
Let's discuss the separating equilibrium first
Tip
In any separating perfect Bayesian equilibrium, 
This is because upon firm seeing the education level 
Similarly, upon seeing high education level 
Then for low type workers, they will have